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Use of DG153 or DG177 secreted protein products for preventing and 
treating pancreatic diseases and/or obesity and/or metabolic syndrome 

Description 

This invention relates to the use of low molecular weight DG153 or DG177 
proteins, to the use of polynucleotides encoding these, and to the use of 
effectors/modulator thereof in the diagnosis, study, prevention, and treatment 
of pancreatic diseases (e.g. diabetes mellitus), obesity and/or metabolic 
syndrome and to the use in regeneration of tissues such as pancreatic tissues 
and others. 

' Many human proteins serve as pharmaceutical^ active compounds. Several 
classes of human proteins that serve as such active compounds include 
hormones, cytokines, cell growth factors, and cell differentiation factors. Most 
proteins that can be used as a pharmaceutical^ active compound fall within 
the family of secreted proteins. Secreted proteins are generally produced 
within cells at rough endoplasmic reticulum, are then exported to the golgi 
complex, and then move to secretory vesicles or granules, where they are 
secreted to the exterior of the cell via exocytosis. Examples for commercially 
used secreted proteins are human insulin, thrombolytic agents, interferons, 
interleukins, colony stimulating factors, human growth hormone, transforming 
growth factor beta, tissue plasminogen activator, erythropoeitin, and various 
other proteins. Receptors of secreted proteins, which are membrane-bound 
proteins, also have potential as therapeutic or diagnostic agents. It is, 
therefore, important for developing new pharmaceutical compounds to identify 
secreted proteins that can be tested for activity in a variety of animal models. 
Thus, in light of the pervasive role of secreted proteins in human physiology, a 
need exists for identifying and characterizing novel functions for human 
secreted proteins and the genes that encode them. This knowledge will allow 
one to detect, to treat, and to prevent medical diseases, disorders, and/or 
conditions by using secreted proteins or the genes that encode them. 

The pancreas is an essential organ possessing both an exocrine function 
involved in the delivery of enzymes into the digestive tract and an endocrine 
function by which various hormones are secreted into the blood stream. The 
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exocrine function is assured by acinar and centroacinar cells that produce 
various digestive enzymes and intercalated ducts that transport these enzymes 
in alkaline solution to the duodenum. The functional unit of the endocrine 
pancreas is the islet of Langerhans. Islets are scattered throughout the 
exocrine portion of the pancreas and are composed of four cell types: alpha-, 
beta-, delta- and PP-cells, reviewed for example in Kim S.K. and Hebrok M., 
(2001) Genes Dev. 15: 111-127. Beta-cells produce insulin, represent the 
majority of the endocrine cells and form the core of the islets, while alpha-cells 
secrete glucagon and are located in the periphery. Delta-cells and PP-cells are 
less numerous and secrete somatostatin and pancreatic polypeptide, 
respectively. 

Early pancreatic development has been well studied in different species, 
including chicken, zebrafish, and mice (for an detailed review, see Kim & 
Hebrock, 2001, supra). The pancreas develops from distinct dorsal and ventral 
anlagen. Pancreas development requires specification of the pancreas 
structure along both anterior-posterior and dorsal-ventral axes. A number of 
transcription factors, which are critical for proper pancreatic development have 
been identified (see Kim & Hebrok, 2001, supra; Wilson M.E. et al., (2003) 
Mech Dev. 120: 65-80). 

In postnatal/adult humans, the acinar and ductal cells retain a significant 
proliferative capacity that can ensure cell renewal and growth, whereas the 
islet cells become mostly mitotically inactive. This is in contrast to rodents 
where beta-cell replication is an important mechanism in the generation of new 
beta cells. It has been suggested, that during embryonic development, 
pancreatic islets of Langerhans originate from differentiating duct cells or other 
cells with epithelial morphology (Bonner-Weir S. and Sharma A., (2002) J 
Pathol. 197: 519-526; Gu G. et al., (2003) Mech Dev. 120: 35-43). In adult 
humans, new beta cells arise in the vicinity of ducts (Butler A.E. et al., (2003) 
Diabetes 52: 102-110; Bouwens L. and Pipeleers D.G., (1998) Diabetologia 
41: 629-633). However, also an intra-islet location or an origin in the bone 
marrow has been suggested for precursor cells of adult beta cells (Zulewski H. 
et al., (2001) Diabetes 50: 521-533; lanus A. et al., (2003) J Clin Invest. 111: 
843-850). Pancreatic islet growth is dynamic and responds to changes in 
insulin demand, such as during pregnancy or during the increase in body mass 
occuring during childhood. In adults, there is a good correlation between body 
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mass and islet mass (Yoon K.H. et al., (2003) J Clin Endocrinol Metab. 88: 
2300-2308). 

Pancreatic beta-cells secrete insulin, which is stimulated by high blood glucose 
levels. Insulin amongst other hormones plays a key role in the regulation of the 
fuel metabolism. Insulin leads to the storage of glycogen and triglycerides and 
to the synthesis of proteins. The entry of glucose into muscles and adipose 
cells is stimulated by insulin. In patients who suffer from diabetes mellitus the 
amount of insulin produced by the pancreatic islet cells is too low, resulting in 
elevated blood glucose levels (hyperglycemia). In diabetes type 1 beta cells 
are lost due. to autoimmune destruction. In type 2 diabetic patients, liver and 
muscle cells loose their ability to respond to normal blood insulin levels (insulin 
resistance). High blood glucose levels (and also high blood lipid levels) lead to 
an impairment of beta-cell function and to an increase in beta-cell apoptosis. It 
is interesting to note that the rate of beta-cell neogenesis does not appear to 
change in type 2 diabetics (Butler et al., 2003, supra), thus causing a reduction 
in total beta-cell mass over time. Eventually the application of exogenous 
insulin becomes necessary in type 2 diabetics. 

Improving metabolic parameters such as blood sugar and blood lipid levels 
(e.g. through dietary changes, exercise, medication or combinations thereof) 
before beta cell mass has fallen below a critical threshold leads to a relatively 
rapid restoration of beta cell function. However, after such a treatment the 
pancreatic endocrine function would remain impaired due to the only slightly 
increased regeneration rate. 

In type 1 diabetics, the lifespan of pancreatic islets is dramatically shortened 
due to autoimmune destruction. Treatments have been devised which 
modulate the immune system and may be able to stop or strongly reduce islet 
destruction (Raz I. et al., (2001) Lancet 358: 1749-1753; Chatenoud L. et al., 
(2003) Nat Rev Immunol. 3: 123-132). However, due to the relatively slow 
regeneration of human beta cells such treatments could only be fully 
successful at improving the diabetic condition if they are combined with an 
agent which can stimulate beta cell regeneration. Thus, both for type 1 and 
type 2 diabetes (early and late stages) there is a need to find novel agents 
which stimulate beta cell regeneration. 
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Diabetes is a very disabling disease, because medications do not control 
blood sugar levels well enough to prevent swinging between high and low 
blood sugar levels. Patients with diabetes are at risk for major complications, 
including diabetic ketoacidosis, end-stage renal disease, diabetic retinopathy 
and amputation. There are also a host of related conditions, such as metabolic 
syndrome, obesity, hypertension, heart disease, peripheral vascular disease, 
and infections, for which persons with diabetes are at substantially increased 
risk. The treatment of these complications contributes to a considerable 
degree to the enormous cost which is imposed by diabetes on health care 
systems world wide. 

Obesity is one of the most prevalent metabolic disorders in the world. It is still a 
poorly understood human disease that becomes as a major health problem 
more and more relevant for western society. Obesity is defined as a body 
weight more than 20% in excess of the ideal body weight, frequently resulting 
in a significant impairment of health. Obesity may be measured by body mass 
index, an indicator of adiposity or fatness. Further parameters for defining 
obesity are waist circumferences, skinfold thickness and bioimpedance. It is 
associated with an increased risk for cardiovascular disease, hypertension, 
diabetes mellitus type II, hyperlipidaemia and an increased mortality rate. 
Obesity is influenced by genetic, metabolic, biochemical, psychological, and 
behavioral factors and can be caused by different reasons such as non-insulin 
dependent diabetes, increase in triglycerides, increase in carbohydrate bound 
energy and low energy expenditure (Kopelman P.G., (2000) Nature 404: 
25 635-643). 

The concept of 'metabolic syndrome' (syndrome x, insulin-resistance 
syndrome, deadly quartet) was first described 1966 by Camus and 
reintroduced 1988 by Reaven (Camus JP, 1966, Rev Rhum Mai Osteoartic 33: 

30 10-14; Reaven et al. 1988, Diabetes, 37: 1595-1607). Today, metabolic 
syndrome is commonly defined as clustering of cardiovascular risk factors like 
hypertension, abdominal obesity, high blood levels of triglycerides and fasting 
glucose as well as low blood levels of HDL cholesterol. Insulin resistance 
greatly increases the risk of developing the metabolic syndrome (Reaven, 

35 2002, Circulation 106: 286-288). The metabolic syndrome often precedes the 
development of type 2 diabetes and cardiovascular disease (Lakka H.M., 
2002, JAMA 288: 2709-2716). The control of blood lipid levels and blood 
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giucose levels Is essential for the treatment of the metabolic syndrome (see, 
for example, Santomauro AT. et al., (1999) Diabetes, 48: 1836-1841). 

The molecular factors regulating food intake and body weight balance are 
incompletely understood. Even if several candidate genes have been 
described which are supposed to influence the homeostatic system(s) that 
regulate body mass/weight, like leptin or the peroxisome proliferator-activated 
receptor-gamma co-activator, the distinct molecular mechanisms and/or 
molecules influencing obesity or body weight/body mass regulations are not 
known. 

There is a need in the prior art for the identification of candidate genes that are 
specifically expressed in early development in certain pancreatic tissues. 
These genes and the thereby encoded proteins can provide tools to the 
diagnosis and treatment of severe pancreatic disorders and related diseases. 
Therefore, this invention describes secreted proteins that are specifically 
expressed in pancreatic tissues early in the development. The invention 
relates to the use of these genes and proteins in the diagnosis, prevention 
and/or treatment of pancreatic dysfunctions, such as diabetes, and other 
related diseases such as obesity and/or metabolic syndrome. These proteins 
and genes are especially useful in regeneration processes, such as 
regeneration of the pancreas cells. 

In this invention, we disclose secreted factors referred to as DG153 and 
DG177 which are involved in pancreas development, regeneration, and in the 
regulation of energy homeostasis. 

The mRNA encoding the low molecular weight DG153 protein was described 
to be expressed in human liver (see Kawamoto et al., Gene 1996 174: 
151-158). The highly conserved arginine-rich protein was shown to be mutated 
in different cancers, such as renal cell carcinomas, pancreatic, lung, breast, 
and prostate cancers, and in squamous cell carcinoma of the head and neck 
(Shridhar et al., Oncogene 12, 1931-1939 (1996), Shridhar et al. Cancer Res. 
56, 5576-5578 (1996), Shridhar et al, Oncogene 14, 2213-2216 (1997)). The 
mutations associated with these cancers appear to be centered in an imperfect 
trinucleotide repeat that putatively encodes a stretch of 15 to 18 arginines. The 
association of these mutations with cancer suggests that the putative protein 
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may be a growth factor that plays an important role in the regulation of cell 
growth and development. No further functional data are available in the 
scientific prior art for this protein. 

5 However, the DG153 protein and possible further uses in different disease 
indications are disclosed in several patent applications. For example, DG153 
is described as human shear stress response protein useful in the diagnosis, 
treatment and screening of vascular diseases caused by arteriosclerosis, 
including heart failure, post-PTCA restenosis and hypertension (see WO 

10 01/25427). Patent application WO 02/74956 describes a 98% identical protein 
as useful for treating neurodegenerative diseases (such as Parkinson's 
disease or Alzheimer's diseases). A 97% identical protein is described in WO 
02/90541 useful for diagnosing behavioral disorder, or assessing the likelihood 
of developing behavioral disorder (like Attention Deficit Hyperactivity Disorder 

15 or intellectual disorders). The precursor protein (179 amino acids) is described 
in patent application WO 01/19851 for treating nervous system diseases or 
disorders (such as Parkinson's disease), and for transplanting cells into 
nervous system. A 180 amino acid fragment of DG153 is described in WO 
01/70174 as being useful for modulating angiogenesis and/or apoptosis for 

20 preventing or treating cancer, myocardial infarction and promoting healing, by 
modulating the activity of vascular endothelial growth factor-modulated gene 
polypeptide. 

DG177 corresponds to human Follistatin-related protein 1 (FLRG), a member 
25 of the follistatin family of proteins. Members of this family contain at least one 
conserved 10-cysteine follistatin domain. In contrast to other members of this 
family, FLRG (and follistatin) share a common exon/intron domain structure, 
substantial primary sequence homology, and an ability to irreversibly bind 
activin. However, the FLRG and follistatin show also structural differences that 
30 are responsible for differing functions. Like follistatins, FLRG binds activins, 
which regulates cell growth and steroidogenesis in the human ovary. Other 
binding partners of FLRG are bone morphogenetic proteins (BMPs). For 
example, FLRG proteins downmodulate the effects of activin A and BMP2 on 
erythroid maturation. In addition, a role of FLRG in the regulation of activin A in 
35 brain wound healing was postulated. 
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The FLRG gene product is thought to be an autoantigen associated with 
rheumatoid arthritis (WO 97/17441, Tanaka et al., 1998, Int. Immunol. 10, 
1305-1314, WO 02/4831 0A2). FLRG and other genes were found to have 
polymorphism-associated effects on the suspectibility to low mineral bone 
density and/or bone damage, and hence osteoporosis (WO 03/054218). The 
expression of FLRG regulated in specific cancers (see, for example, WO 
03/042661, WO 03/027633, WO 02/12440, US2002014298A1, WO 
02/059377, WO 02/21996). 

To the best of our knowledge, no disclosure has been made in the prior art that 
describes a function of DG153 or DG177 in pancreatic disorders including 
diabetes, in obesity or metabolic syndrome. In addition, a role of the secreted 
factors DG153 or DG177 in the regeneration of tissues such as pancreatic 
tissues and others is novel and surprising. Thus, since secreted proteins are a 
major target for drug action and development, it is of high value to identify and 
characterize novel functions for secreted proteins. The present invention 
advances the state of the art by providing previously unknown functions for the 
low molecular weight human secreted proteins DG153 and DG177. This 
satisfies a need in the art by providing new therapeutic strategies for treating 
pancreatic diseases (e.g. diabetes), obesity, and/or metabolic syndrome, for 
example, treating diabetes by regeneration of beta cells. DG153 or DG177 can 
be used to promote a regeneration of, for exmple, pancreas cells which then 
start producing insulin on their own. Therefore, further discovery and 
development of such novel functions for secreted proteins like DG153 or 
DG177 would have a strong beneficial affect on medical services and 
healthcare. 

Accordingly, the present invention relates to secreted proteins with novel 
functions in the human metabolism, regeneration and pancreatic 
developmental processes. The present invention discloses specific genes and 
proteins encoded thereby and effectors/modulators thereof involved in the 
regulation of pancreatic function and metabolism, especially in pancreas 
diseases such as diabetes mellitus, e.g. insulin dependent diabetes mellitus 
and/or non-insulin dependent diabetes mellitus, and/or metabolic syndrome, 
obesity, and/or related disorders such as coronary heart disease, eating 
disorder, cachexia, hypertension, hypercholesterolemia (dyslipidemia), liver 
fibrosis, and/or gallstones. Further, the present invention discloses specific 
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genes and proteins encoded thereby and effectors/modulators thereof involved 
in the modulation, e.g. stimulation, of pancreatic development and/or 
regeneration of pancreatic cells or tissues, e.g. cells having exocrinous 
functions such as acinar cells, centroacinar cells and/or ductal cells, and/or 
cells having endocrinous functions, particularly cells in Langerhans islets such 
as alpha-, beta-, delta- and/or PP-cells, more particularly beta-cells. 

In this invention, we used a screen for secreted factors expressed in 
developing mammalian (mouse) pancreas, as described in more detail in the 
Examples section (see Example 1). This screen identified DG153 and DG177 
as secreted factors expressed in developing mouse pancreas. The present 
invention describes mammalian DG153 and DG177 proteins and the 
polynucleotides encoding these, in particular human DG153 and DG177, as 
being involved in the conditions and processes mentioned above. 

The present invention relates to DG153 or DG177 polynucleotides encoding 
polypeptides with novel functions in the development and regeneration of 
pancreatic tissues and thus in mammalian pancreatic diseases (e.g. diabetes), 
and also in body-weight regulation, energy homeostasis, and obesity, 
fragments of said polynucleotides, polypeptides encoded by said 
polynucleotides or fragments thereof. The invention also relates to vectors, 
host cells, and recombinant methods for producing the polypeptides and 
polynucleotides of the invention. The invention also relates to 
effectors/modulators of DG153 or DG177 polynucleotides and/or polypeptides, 
e.g. antibodies, biologically active nucleic acids, such as antisense molecules, 
RNAi molecules or ribo2ymes, aptamers, peptides or low-molecular weight 
organic compounds recognizing said polynucleotides or polypeptides. 

DG153 or DG177 homologous proteins and nucleic acid molecules coding 
therefore are obtainable from vertebrate species. Particularly preferred are 
nucleic acids encoding the human DG153 or DG177 proteins and variants 
thereof. The invention particularly relates to nucleic acid molecules encoding 
polypeptides contributing to regulating the energy homeostasis and the 
mammalian metabolism, wherein said nucleic acid molecules comprise 
(a) the nucleotide sequence of human DG153 (SEQ ID NO: 1) and/or 

human DG177 (SEQ ID NO: 4) and/or a sequence complementary 

thereto, 
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(b) a nucleotide sequence which hybridizes at 50°C in a solution containing 
1 x SSC and 0.1 % SDS to a sequence of (a), 

(c) a sequence corresponding to the sequences of (a) or (b) within the 
degeneration of the genetic code, 

s (d) a sequence which encodes a polypeptide which is at least 85%, 
preferably at least 90%, more preferably at least 95%, more preferably 
at least 98% and up to 99,6% identical to an amino acid sequence of 
the human DG153 protein (SEQ ID NO: 2 or 3) or DG177 protein (SEQ 
ID NO: 5), 

10 (e) a sequence which differs from the nucleic acid molecules of (a) to (d) by 
mutation and wherein said mutation causes an alteration, deletion, 
duplication and/or premature stop in the encoded polypeptide or 
(f) a partial sequence of any of the nucleotide sequences of (a) to (e) 
having a length of 15-25 bases, preferably 25-35 bases, more 

is preferably 35-50 bases and most preferably at least 50 bases. 

The function of the mammalian DG153 or DG177 in metabolism was validated 
by analyzing the expression of the transcripts in different tissues and by 
analyzing the role in adipocyte differentiation. 

20 

Expression profiling studies (see Examples for more detail) confirm the 
particular relevance of DG153 and DG177 as regulators of energy metabolism 
in mammals. 

25 Quantitative (Taqman) PCR analysis revealed that DG153 is expressed in 
several mammalian tissues, with highest expression levels in testis. In addition, 
DG153 is highly expressed in metabolic active tissue such as white adipose 
tissue (WAT) compared to other tissue types in wild type mouse as depicted in 
Fig. 2A. BAT is a well-characterized tissue which is well developed in newborn 

30 mammals, including humans. One important task of BAT is to generate heat 
and maintain body temperature homeostasis in newborn. Thus, an expression 
of DG153 protein in adipose tissues is confirming a role in the regulation of 
metabolism, particularly energy homeostasis and thermogenesis. 

35 We used mouse models of insulin resistance and/or diabetes, such as mice 
carrying gene knockouts in the leptin pathway (for example, ob/ob (leptin) or 
db (leptin receptor/ligand) mice) to study the expression of DG153. Such mice 
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develop typical symptoms of diabetes, show hepatic lipid accumulation and 
frequently have increased plasma lipid levels (see Bruning et al, 1998, Mol. 
Cell. 2: 559-569). We found, for example, that the expression of DG153 is 
strongly down-regulated in metabolic active tissue (WAT) in genetically 
5 induced obese mice (ob/ob) compared to fasted mice (see Fig. 2B). These 
data further support an essential role of DG153 in the regulation of the 
mammalian metabolism, particularly in processes related to obesity or 
metabolic syndrome. 

o Expression of DG153 mRNA was also examined in susceptible wild type mice 
(for example, C57BI/6) that show symptoms of diabetes, lipid accumulation, 
and high plasma lipid levels, if fed a high fat diet. In those mice, the expression 
of DG153 is significantly down-regulated in WAT supporting the finding in 
ob/ob mice. In addition, the expression of DG153 is up-regulated in BAT and 

is muscle tissues. These results confirm a role of DG153 is involved in the 
regulation of mammalian metabolism (see Fig. 2C). 

Further, we show (see Examples and Fig. 2D-2F) that the DG153 protein has 
to be down-regulated in order for the preadipocyctes to differentiate into 

20 mature adipocyctes. With regard to changes in expression intensity during the 
differentiation of preadipocytes to adipocytes, a slight reduction in relative 
signal intensity can be observed for DG153 during the in vitro differentiation 
program of 3T3-L1 (see Fig. 2D). Therefore, the DG153 protein might play an 
essential role in adipogenesis. The results are suggesting a role as modulator 

25 of adipogenesis. 

In mouse models of insulin resistance and/or diabetes expression, DG177 is 
significantly up-regulated in metabolic active tissues (BAT) and muscle which 
is supporting an essential role of DG177 in the regulation of the mammalian 

30 metabolism, particularly in processes related to, obesity, diabetes, or metabolic 
syndrome (Fig. 4B). We also found, that the expression of DG177 is down- 
regulated in WAT and liver of fasted wild-type mice, compared to wild type 
mice fed a standard diet (Fig. 4B). In addition, expression of DG177 mRNA is 
significantly up-regulated in WAT and muscle in mice showing symptoms of 

35 diabetes, lipid accumulation, and high plasma lipid levels, if fed a high fat diet 
(Fig. 4C). In non-obese diabetic (NOD) mice, expression of DG177 is 
significantly down-regulated in white adipose tissue compared to wild type 
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mice, as shown in Fig. 4D. Thus, DG177 protein is playing a role in the 
regulation of metabolism, particularly energy homeostasis and thermogenesis. 

We show (see Examples and Fig. 4E) that the DG177 mRNA is down- 
5 regulated during differentiation of preadipocyctes into mature adipocytes. The 
results are suggesting a role of DG177 as modulator of adipogenesis. 

Microarrays are analytical tools routinely used in bioanalysis. A microarray has 
molecules distributed over, and stably associated with, the surface of a solid 

io support. The term "microarray" refers to an arrangement of a plurality of 
polynucleotides, polypeptides, antibodies, or other chemical compounds on a 
substrate. Microarrays of polypeptides, polynucleotides, and/or antibodies 
have been developed and find use in a variety of applications, such as 
monitoring gene expression, drug discovery, gene sequencing, gene mapping, 

15 bacterial identification, and combinatorial chemistry. One area in particular in 
which microarrays find use is in gene expression analysis (see Example 4). 
Array technology can be used to explore the expression of a single 
polymorphic gene or the expression profile of a large number of related or 
unrelated genes. When the expression of a single gene is examined, arrays 

20 are employed to detect the expression of a specific gene or its variants. When 
an expression profile is examined, arrays provide a platform for identifying 
genes that are tissue specific, are affected by a substance being tested in a 
toxicology assay, are part of a signaling cascade, carry out housekeeping 
functions, or are specifically related to a particular genetic predisposition, 

25 condition, disease, or disorder. 

Microarrays may be prepared, used, and analyzed using methods known in the 
art (see for example, Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; 
Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 93: 10614-10619; 

so Baldeschweiler et al., PCT application WO 95/251 116; Shalon, D. et al., POT 
application WO 95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. 94: 
2150-2155; Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662). Various types 
of microarrays are well known and thoroughly described in Schena, M., ed. 
(1999; DNA Microarrays: A Practical Approach, Oxford University Press, 

35 London). 
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Oligonucleotides or longer fragments derived from any of the polynucleotides 
described herein may be used as elements on a microarray. The microarray 
ran be used in transcript imaging techniques which monitor the relative 
expression levels of large numbers of genes simultaneously as described 

5 below. The microarray may also be used to identify genetic variants, 
mutations, and polymorphisms. This information may be used to determine 
gene function, to understand the genetic basis of a disorder, to diagnose a 
disorder, to monitor progression/regression of disease as a function of gene 
expression, and to develop and monitor the activities of therapeutic agents in 

o the treatment of disease. In particular, this information may be used to develop 
a pharmacogenomic profile of a patient in order to select the most appropriate 
and effective treatment regimen for that patient. For example, therapeutic 
agents which are highly effective and display the fewest side effects may be 
selected for a patient based on his/her pharmacogenomic profile. 

15 

As determined by microarray analysis, DG153 shows differential expression in 
human primary adipocytes. A downregulation is observed concerning the 
expression of DG153 is during the murine 3T3-L1 (Fig. 2E) and human 
adipocyte differentiation (see Fig. 2F). The DG153 protein in preadipocyctes 

20 has the potential to inhibit adipose differentiation at a very early stage. 
Therefore, the DG153 protein might play an essential role in adipogenesis. 
The results are suggesting a role of DG153 in the regulation in human 
metabolism, for example, as effector/modulator (for example, inhibitor) of 
adipogenesis. Thus, DG153 is a strong candidate for the manufacture of a 

25 pharmaceutical composition and a medicament for the treatment of conditions 
related to human metabolism, such as diabetes, obesity, and/or metabolic 
syndrome. 

The invention also encompasses polynucleotides that encode the proteins of 
30 the invention and homologous proteins. Accordingly, any nucleic acid 
sequence, which encodes the amino acid sequences of the proteins of the 
invention and homologous proteins, can be used to generate recombinant 
molecules that express the proteins of the invention and homologous proteins. 
In a particular embodiment, the invention encompasses nucleic acids encoding 
35 DG153 or DG177. It will be appreciated by those skilled in the art that as a 
result of the degeneracy of the genetic code, a multitude of nucleotide 
sequences encoding the proteins, some bearing minimal homology to the 
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nucleotide sequences of any known and naturally occurring gene, may be 
produced. The invention contemplates each and every possible variation of 
nucleotide sequence that can be made by selecting combinations based on 
possible codon choices. 

5 

Also encompassed by the invention are polynucleotide sequences that are 
capable of hybridizing to the claimed nucleotide sequences, and in particular, 
those of the polynucleotide encoding the proteins of the invention, under 
various conditions of stringency. Hybridization conditions are based on the 

10 melting temperature (Tm) of the nucleic acid binding complex or probe, as 
taught in Wahl & Berger (1987: Methods Enzymol, 152: 399-407) and Kimmel 
(1987; Methods Enzymol. 152: 507-511), and may be used at a defined 
stringency. Preferably, hybridization under stringent conditions means that 
after washing for 1 h with 1 x SSC and 0.1% SDS at 50°C, preferably at 55°C, 

is more preferably at 62°C and most preferably at 65°C, particularly for 1 h in 0.2 
x SSC and 0.1% SDS at 50°C, preferably at 55°C, more preferably at 62°C 
and most preferably at 65°C, a positive hybridization signal is observed. 
Altered nucleic acid sequences encoding the proteins which are encompassed 
by the invention include deletions, insertions or substitutions of different 

20 nucleotides resulting in a polynucleotide that encodes the same or a 
functionally equivalent protein. 

The encoded proteins may also contain deletions, insertions or substitutions of 
amino acid residues, which produce a silent change and result in functionally 

25 equivalent proteins. Deliberate amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, 
and/or the amphipathic nature of the residues as long as the biological activity 
of the protein is retained. Furthermore, the invention relates to peptide 
fragments of the proteins or derivatives thereof such as cyclic peptides, 

30 retro-inverso peptides or peptide mimetics having a length of at least 4, 
preferably at least 6 and up to 50 amino acids. 

Also included within the scope of the present invention are alleles of the genes 
encoding the proteins of the invention and homologous proteins. As used 
35 herein, an 'allele' or 'allelic sequence' is an alternative form of the gene, which 
may result from at least one mutation in the nucleic acid sequence. Alleles may 
result in altered mRNAs or polypeptides whose structures or function may or 
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may not be altered. Any given gene may have none, one or many allelic forms. 
Common mutational changes, which give rise to alleles, are generally ascribed 
to natural deletions, additions or substitutions of nucleotides. Each of these 
types of changes may occur alone or in combination with the others, one or 
more times in a given sequence. 

The nucleic acid sequences encoding DG153 or DG177 and homologous 
proteins may be extended utilizing a partial nucleotide sequence and 
employing various methods known in the art to detect upstream sequences 
such as promoters and regulatory elements. 

In order to express a biologically active protein, the nucleotide sequences 
encoding the proteins or functional equivalents, may be inserted into 
appropriate expression vectors, i.e., a vector which contains the necessary 
elements for the transcription and translation of the inserted coding sequence. 
Methods, which are well known to those skilled in the art, may be used to 
construct expression vectors containing sequences encoding the proteins and 
the appropriate transcriptional and translational control elements. Regulatory 
elements include for example a promoter, an initiation codon, a stop codon, a 
mRNA stability regulatory element, and a polyadenylation signal. Expression of 
a polynucleotide can be assured by (i) constitutive promoters such as the 
Cytomegalovirus (CMV) promoter/enhancer region, (ii) tissue specific 
promoters such as the insulin promoter (see, Soria et al., 2000, Diabetes 49: 
157), SOX2 gene promoter (see Li et al., (1998) Curr. Biol. 8: 971-974), Msi-1 
promotor (see Sakakibara et al., (1997) J. Neuroscience 17: 8300-8312), 
alpha-cardia myosin heavy chain promotor or human atrial natriuretic factor 
promotor (Klug et al., (1996) J. Clin. Invest 98: 216-224; Wu et al., (1989) J. 
Biol. Chem. 264: 6472-6479) or (iii) inducible promoters such as the 
tetracycline inducible system. Expression vectors can also contain a selection 
agent or marker gene that confers antibiotic resistance such as the neomycin, 
hygromycin or puromycin resistance genes. These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic 
recombination. Such techniques are described in Sambrook, J. et al. (1989) 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, 
N.Y. and Ausubel, F.M. et al. (1989) Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, N.Y. 
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In a further embodiment of the invention, natural, modified or recombinant 
nucleic acid sequences encoding the proteins of the invention and 
homologous proteins may be ligated to a heterologous sequence to encode a 
fusion protein. 

5 

A variety of expression vector/host systems, as known in the art, may be 
utilized to contain and express sequences encoding the proteins or fusion 
proteins. These include, but are not limited to, micro-organisms such as 
bacteria transformed with recombinant bacteriophage, plasmid or cosmid DNA 

10 expression vectors; yeast transformed with yeast expression vectors; insect 
cell systems infected with virus expression vectors (e.g., baculovirus, 
adenovirus, adeno-associated virus, lentiverus, retrovirus); plant cell systems 
transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., 

is Tl or PBR322 plasmids); or animal cell systems. 

The presence of polynucleotide sequences of the invention in a sample can be 
detected by DNA-DNA or DNA-RNA hybridization and/or amplification using 
probes or portions or fragments of said polynucleotides. Nucleic acid 

20 amplification based assays involve the use of oligonucleotides or oligomers 
based on the sequences specific for the gene to detect transformants 
containing DNA or RNA encoding the corresponding protein. As used herein 
'oligonucleotides' or 'oligomers' refer to a nucleic acid sequence of at least 
about 10 nucleotides and as many as about 60 nucleotides, preferably about 

25 15 to 30 nucleotides, and more preferably about 20-25 nucleotides, which can 
be used as a probe or amplimer. 

A wide variety of labels and conjugation techniques are known by those skilled 
in the art and may be used in various nucleic acid and amino acid assays. 

30 Means for producing labeled hybridization or PCR probes for detecting 
polynucleotide sequences include oligo-labeling, nick translation, end-labeling 
of RNA probes, PCR amplification using a labeled nucleotide, or enzymatic 
synthesis. These procedures may be conducted using a variety of 
commercially available kits (Pharmacia & Upjohn, (Kalamazoo, Mich.); 

35 Promega (Madison Wis.); and U.S. Biochemical Corp., (Cleveland, Ohio). 

The presence of DG153 or DG177 in a sample can be determined by 
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Immunological methods or activity measurement. A variety of protocols for 
detecting and measuring the expression of proteins, using either polyclonal or 
monoclonal antibodies specific for the protein or reagents for determining 
protein activity are known in the art. Examples include enzyme-linked 

5 immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes on the 
protein is preferred, but a competitive binding assay may be employed. These 
and other assays are described, among other places, in Hampton, R. et al. 

10 (1990; Serological Methods, a Laboratory Manual, APS Press, St Paul, Minn.) 
and Maddox, D.E. etai. (1983; J. Exp. Med. 158: 1211-1226). 

Suitable reporter molecules or labels, which may be used, include 
radionuclides, enzymes, fluorescent, chemiluminescent or chromogenic agents 
15 as well as substrates, co-factors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding a protein of the 
invention may be cultured under conditions suitable for the expression and 
recovery of said protein from cell culture. The protein produced by a 

20 recombinant cell may be secreted or contained intracellularly depending on the 
sequence or/and the vector used. As will be understood by those of skill in the 
art, expression vectors containing polynucleotides, which encode the protein 
may be designed to contain signal sequences, which direct secretion of the 
protein through a prokaryotic or eukaryotic cell membrane. Other recombinant 

25 constructions may be used to join sequences encoding the protein to 
nucleotide sequence encoding a polypeptide domain, which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, 
but are not limited to, metal chelating peptides such as histidine-tryptophan 
modules that allow purification on immobilized metals, protein A domains that 

30 allow purification on immobilized immunoglobulin, and the domain utilized in 
the FLAG extension/affinity purification system (Immunex Corp., Seattle, 
Wash.) The inclusion of cleavable linker sequences such as those specific for 
Factor XA or Enterokinase (Invitrogen, San Diego, Calif.) between the 
purification domain and the desired protein may be used to facilitate 

35 purification. 
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The data disclosed in this invention show that the DG153 and DG177 nucleic 
acids and proteins and effector/modulator molecules thereof are useful in 
diagnostic and therapeutic applications implicated, for example, but not limited 
to, pancreatic diseases (e.g. diabetes mellitus such as insulin dependent 

s diabetes mellitus and/or non insulin dependent diabetes mellitus), obesity, 
metabolic syndrome, eating disorder, cachexia, hypertension, coronary heart 
disease, hypercholesterolemia (dyslipidemia), and/or gallstones. Further, the 
data show that the DG153 and DG177 nucleic acids and proteins and 
effector/modulator molecules thereof are useful for the modulation, e.g. 

10 stimulation, of pancreatic development and/or for the regeneration of 
pancreatic cells or tissues, e.g. cells having exocrinous functions such as 
acinar cells, centroacinar cells and/or ductal cells, and/or cells having 
endocrinous functions, particularly cells in Langerhans islets such as alpha-, 
beta-, delta- and/or PP-cells, more particularly beta-cells. Hence, diagnostic 

15 and therapeutic uses for the proteins of the invention nucleic acids and 
proteins of the invention are, for example but not limited to, the following: (i) 
tissue regeneration in vitro and in vivo (regeneration for all these tissues and 
cell types composing these tissues and cell types derived from these tissues), 
(ii) small molecule drug target, (Hi) antibody target (therapeutic, diagnostic, 

20 drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic marker, (v) 
protein therapy, (vi) gene therapy (gene delivery/gene ablation), and / or (vii) 
research tools. 

For example, but not limited to, cDNAs encoding the proteins of the invention 
25 and particularly their human homologues may be useful in stimulating, 
enhancing or regulating the regeneration of tissues, and the proteins of the 
invention and particularly their human homologues may be useful when 
administered to a subject in need thereof. By way of non-limiting example, the 
compositions of the present invention will have efficacy for treatment of 
30 patients suffering from, for example, pancreatic diseases (e.g. diabetes), 
obesity, and/or metabolic syndrome as described above. 

In one embodiment of the invention, administration of DG153 and/or DG177 
nucleic acids and proteins and/or effectors/modulators thereof in a 
35 pharmaceutical composition to a subject in need thereof, particularly a human 
patient, leads to an at least partial regeneration of, for example, pancreas 
cells. The composition will then at least partially restore normal pancreatic 
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function. In one example, these cells are beta cells of the islets which then 
start producing insulin on their own. After administration of this composition 
e.g. on a regular basis, the beta cells of a diabetic subject will grow back to 
approach the normal size and number present in a nondiabetic subject. This 

5 effect upon the body reverses the condition of diabetes. As the subject's blood 
sugar level improves, the dosage administered may be reduced in strength. In 
at least some cases further administration can be discontinued entirely and the 
subject continues to produce a normal amount of insulin without further 
treatment. The subject is thereby not only treated but cured entirely of a 

10 diabetic condition. Further, beta cells or precursors thereof may be treated in 
vitro and implanted or reimplanted into a subject in need thereof. Furthermore, 
other cells of the pancreas can be regenerated in vivo and/or in vitro to cure a 
certain condition. 

is. Beside diabetes, the compositions of the present invention will also have 
efficacy for treatment of patients with other pancreatic diseases such as 
pancreatic cancer, dysplasia or pancreatitis. 

The DG153 or DG177 nucleic acids and proteins and effectors/modulators 
20 thereof are useful in diagnostic and therapeutic applications implicated in 
various embodiments as described below. For example, but not limited to, 
cDNAs encoding the proteins of the invention and particularly their human 
homologues may be useful in gene therapy, and the proteins of the invention 
and particularly their human homologues may be useful when administered to 
25 a subject in need thereof. By way of non-limiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from, 
for example, pancreatic diseases (e.g. diabetes), obesity, and/or metabolic 
syndrome as described above. 

30 The nucleic acids of the invention or fragments thereof, may further be useful 
in diagnostic applications, wherein the presence or amount of the nucleic acids 
or the proteins are to be assessed. Further antibodies that bind 
immunospecifically to the novel substances of the invention may be used in 
therapeutic or diagnostic methods. 

35 

For example, in one aspect, antibodies, which are specific for the proteins of 
the invention and homologous proteins, may be used directly as an 
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effector/modulator, e.g. an antagonist or an agonist, or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue 
which express the protein. The antibodies may be generated using methods 
that are well known in the art. Such antibodies may include, but are not limited 
to, polyclonal, monoclonal, chimeric single chain, Fab fragments, and 
fragments produced by a Fab expression library. Neutralising antibodies, (i.e., 
those which inhibit dimer formation) are especially preferred for therapeutic 
use. 

For the production of antibodies, various hosts including goats, rabbits, rats, 
mice, humans, and others, may be immunized by injection with the protein or 
any fragment or oligopeptide thereof which has immunogenic properties. 
Depending on the host species, various adjuvants may be used to increase 
immunological response. It is preferred that the peptides, fragments or 
oligopeptides used to induce antibodies to the protein have an amino acid 
sequence consisting of at least five amino acids, and more preferably at least 
10 amino acids. 

Monoclonal antibodies to the proteins may be prepared using any technique 
that provides for the production of antibody molecules by continuous cell lines 
in culture. These include, but are not limited to, the hybridoma technique, the 
human B-cell hybridoma technique, and the EBV-hybridoma technique 
(Kohler, G. and Milstein C, (1975) Nature 256: 495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81: 31-42; Cote, R.J. et al., (1983) Proc. Natl. 
Acad. Sci. 80: 2026-2030; Cole, S.P. et al., (1984) Mol. Cell Biochem. 62: 
109-120). 

In addition, techniques developed for the production of 'chimeric antibodies', 
the splicing of mouse antibody genes to human antibody genes to obtain a 
molecule with appropriate antigen specificity and biological activity can be 
used (Morrison, S.L. et al., (1984) Proc. Natl. Acad. Sci. 81: 6851-6855; 
Neuberger, M.S. et al (1984) Nature 312: 604-608; Takeda, S. et al. (1985) 
Nature 314: 452-454). Alternatively, techniques described for the production of 
single chain antibodies may be adapted, using methods known in the art, to 
produce single chain antibodies specific for the proteins of the invention and 
homologous proteins. Antibodies with related specificity, but of distinct idiotypic 
composition, may be generated by chain shuffling from random combinatorial 
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immunoglobulin libraries (Kang A.S. et al. v (1991) Proc. Natl. Acad. Sci. 88: 
11120-11123). Antibodies may also be produced by inducing in vivo 
production in the lymphocyte population or by screening recombinant 
immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature (Orfandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86: 
3833-3837; Winter, G. and Milstein C, (1991) Nature 349: 293-299). 

Antibody fragments, which contain specific binding sites for the proteins may 
also be generated. For example, such fragments include, but are not limited to, 
the F(ab') 2 fragments which can be produced by Pepsin digestion of the 
antibody molecule and the Fab fragments which can be generated by reducing 
the disulfide bridges of F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity (Huse, W.D. et al. 
(1989) Science 246: 1275-1281). 

Various immunoassays may be used for screening to identify antibodies 
having the desired specificity. Numerous protocols for competitive binding and 
immunoradiometric assays using either polyclonal or monoclonal antibodies 
with established specificities are well known in the art. Such immunoassays 
typically involve the measurement of complex formation between the protein 
and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering protein epitopes are 
preferred, but a competitive binding assay may also be employed (Maddox, 
supra). 

In another embodiment of the invention, the polynucleotides or fragments 
thereof or nucleic acid effector/modulator molecules such as antisense 
molecules, aptamers, RNAi molecules or ribozymes may be used for 
therapeutic purposes. In one aspect, aptamers, i.e. nucleic acid molecules, 
which are capable of binding to a protein of the invention and modulating its 
activity, may be generated by a screening and selection procedure involving 
the use of combinatorial nucleic acid libraries. 

In a further aspect, antisense molecules may be used in situations in which it 
would be desirable to block the transcription of the mRNA. In particular, cells 
may be transformed with sequences complementary to polynucleotides 
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encoding DG153 or DG177 and/or homologous proteins. Thus, antisense 
molecules may be used to modulate/effect protein activity or to achieve 
regulation of gene function. Such technology is now well known in the art, and 
sense or antisense oligomers or larger fragments, can be designed from 
various locations along the coding or control regions of sequences encoding 
the proteins. Expression vectors derived from retroviruses, adenovirus, herpes 
or vaccinia viruses or from various bacterial plasmids may be used for delivery 
of nucleotide sequences to the targeted organ, tissue or cell population. 
Methods, which are well known to those skilled in the art, can be used to 
construct recombinant vectors, which will express antisense molecules 
complementary to the polynucleotides of the genes encoding the proteins of 
the invention and homologous proteins. These techniques are described both 
in Sambrook et al. (supra) and in Ausubel et al. (supra). Genes encoding the 
proteins of the invention and homologous proteins can be turned off by 
transforming a cell or tissue with expression vectors^ which express high levels 
of polynucleotides that encode the proteins of the invention and homologous 
proteins or fragments thereof. Such constructs may be used to introduce 
untranslatable sense or antisense sequences into a cell. Even in the absence 
of integration into the DNA, such vectors may continue to transcribe RNA 
molecules until they are disabled by endogenous nucleases. Transient 
expression may last for a month or more with a non-replicating vector and 
even longer if appropriate replication elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by 
designing antisense molecules, e.g. DNA, RNA or nucleic acid analogues such 
as PNA, to the control regions of the genes encoding DG153 or DG177 and/or 
homologous proteins, i.e., the promoters, enhancers, and introns. 
Oligonucleotides derived from the transcription initiation site, e.g., between 
positions -10 and +10 from the start site, are preferred. Similarly, inhibition can 
be achieved using "triple helix" base-pairing methodology. Triple helix pairing 
is useful because it cause inhibition of the ability of the double helix to open 
sufficiently for the binding of polymerases, transcription factors or regulatory 
molecules. Recent therapeutic advances using triplex DNA have been 
described in the literature (Gee, J.E. et al. (1994) Gene 149: 109-114; Huber, 
B.E. and Carr B.I., Molecular and Immunologic Approaches, Futura Publishing 
Co., Mt. Kisco, N.Y.). The antisense molecules may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 
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Ribozymes, enzymatic RNA molecules, may also be used to catalyze the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Examples, which may be 
used, include engineered hammerhead motif ribozyme molecules that can be 
specifically and efficiently catalyze endonucleolytic cleavage of sequences 
encoding the proteins of the invention and homologous proteins. Specific 
ribozyme cleavage sites within any potential RNA target are initially identified 
by scanning the target molecule for ribozyme cleavage sites which include the 
following sequences: GUA, GUU, and GUC. Once identified, short RNA 
sequences of between 15 and 20 ribonucleotides corresponding to the region 
of the target gene containing the cleavage site may be evaluated for 
secondary structural features which may render the oligonucleotide 
inoperable. The suitability of candidate targets may also be evaluated by 
testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. 

Nucleic acid effector/modulator molecules, e.g. antisense molecules and 
ribozymes may be prepared by any method known in the art for the synthesis 
of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical 
synthesis. Alternatively, RNA molecules may be generated by in vitro and in 
vivo transcription of DNA sequences. Such DNA sequences may be 
incorporated into a variety of vectors with suitable RNA polymerase promoters 
such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
antisense RNA constitutively or inducibly can be introduced into cell lines, cells 
or tissues. RNA molecules may be modified to increase intracellular stability 
and half-life. Possible modifications include, but are not limited to, the addition 
of flanking sequences at the 5' and/or 3' ends of the molecule or modifications 
in the nucleobase, sugar and/or phosphate moieties, e.g. the use of 
phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production 
of PNAs and can be extended in all of these molecules by the inclusion of 
non-traditional bases such as inosine, queosine, and wybutosine, as well as 
acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by 
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endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and 
equally suitable for use in vivo, in vitro, and ex vivo. For ex vivo therapy, 

5 vectors may be introduced into stem cells taken from the patient and clonally 
propagated for autologous transplant back into that same patient. Delivery by 
transfection and by liposome injections may be achieved using methods, which 
are well known in the art. Any of the therapeutic methods described above may 
be applied to any suitable subject including, for example, mammals such as 

10 dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical composition, in conjunction with a pharmaceutical^ acceptable 
carrier, for any of the therapeutic effects discussed above. Such 

15 pharmaceutical compositions may consist of DG153 or DG177 nucleic acids 
and the proteins and homologous nucleic acids or proteins, antibodies to the 
proteins of the invention and homologous proteins, mimetics, agonists, 
antagonists or inhibitors of the proteins of the invention and homologous 
proteins or nucleic acids. The compositions may be administered alone or in 

20 combination with at least one other agent, such as stabilizing compound, which 
may be administered in any sterile, biocompatible pharmaceutical carrier, 
including, but not limited to, saline, buffered saline, dextrose, and water. The 
compositions may be administered to a patient alone or in combination with 
other agents, drugs or hormones. The pharmaceutical compositions utilized in 

25 this invention may be administered by any number of routes including, but not 
limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, 
intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual or rectal means. 

30 In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically-acceptable carriers comprising excipients 
and auxiliaries, which facilitate processing of the active compounds into 
preparations, which can be used pharmaceutical^. Further details on 
techniques for formulation and administration may be found in the latest edition 

35 of Remington's Pharmaceutical Sciences (Maack Publishing Co., Easton, Pa.). 

Pharmaceutical compositions suitable for use in the invention include 
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compositions wherein the active ingredients are contained in an effective 
amount to achieve the intended purpose. The determination of an effective 
dose is well within the capability of those skilled in the art. For any compounds, 
the therapeutically effective dose can be estimated initially either in cell culture 
assays, e.g., of preadipocyte cell lines or in animal models, usually mice, 
rabbits, dogs or pigs. The animal model may also be used to determine the 
appropriate concentration range and route of administration. Such information 
can then be used to determine useful doses and routes for administration in 
humans. A therapeutically effective dose refers to that amount of active 
ingredient, for example the DG153 or DG177 nucleic acids or proteins or 
fragments thereof or antibodies, which is sufficient for treating a specific 
condition. Therapeutic efficacy and toxicity may be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 
(the dose therapeutically effective in 50% of the population) and LD50 (the 
dose lethal to 50% of the population). The dose ratio between therapeutic and 
toxic effects is the therapeutic index, and it can be expressed as the ratio, 
LD50/ED50. Pharmaceutical compositions, which exhibit large therapeutic 
indices, are preferred. The data obtained from cell culture assays and animal 
studies is used in formulating a range of dosage for human use. The dosage 
contained in such compositions is preferably within a range of circulating 
concentrations that include the ED50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage from employed, sensitivity 
of the patient, and the route of administration. The exact dosage will be 
determined by the practitioner, in light of factors related to the subject that 
requires treatment. Dosage and administration are adjusted to provide 
sufficient levels of the active moiety or to maintain the desired effect. Factors, 
which may be taken into account, include the severity of the disease state, 
general health of the subject, age, weight, and gender of the subject, diet, time 
and frequency of administration, drug combination(s), reaction sensitivities, 
and tolerance/response to therapy. Long-acting pharmaceutical compositions 
may be administered every 3 to 4 days, every week or once every two weeks 
depending on half-life and clearance rate of the particular formulation. Normal 
dosage amounts may vary from 0.1 to 100,000 microg, up to a total dose of 
about 1 g, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature and 
generally available to practitioners in the art. Those skilled in the art employ 
different formulations for nucleotides than for proteins or their inhibitors. 
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Similariy, delivery of polynucleotides or polypeptides will be specific to 
particular cells, conditions, locations, etc. 

In another embodiment, antibodies which specifically bind to the proteins may 
be used for the diagnosis of conditions or diseases characterized by or 
associated with over- or under-expression of the proteins of the invention and 
homologous proteins or in assays to monitor patients being treated with the 
proteins of the invention and homologous proteins, or effectors / modulators 
thereof, e.g. agonists, antagonists, or inhibitors. Diagnostic assays include 
methods which utilize the antibody and a label to detect the protein in human 
body fluids or extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by joining them, either covalently or 
non-covalently, with a reporter molecule. A wide variety of reporter molecules, 
which are known in the art may be used several of which are described above. 

A variety of protocols including ELISA, RIA, and FACS for measuring proteins 
are known in the art and provide a basis for diagnosing altered or abnormal 
levels of gene expression. Normal or standard values for gene expression are 
established by combining body fluids or cell extracts taken from normal 
mammalian subjects, preferably human, with antibodies to the protein under 
conditions suitable for complex formation. The amount of standard complex 
formation may be quantified by various methods, but preferably by photometric 
means. Quantities of protein expressed in control and disease, samples e.g. 
from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides specific for the 
DG153 or DG177 proteins and homologous proteins may be used for 
diagnostic purposes. The polynucleotides, which may be used, include 
oligonucleotide sequences, antisense RNA and DNA molecules, and PNAs. 
The polynucleotides may be used to detect and quantitate gene expression in 
biopsied tissues in which gene expression may be correlated with disease. 
The diagnostic assay may be used to distinguish between absence, presence, 
and excess gene expression, and to monitor regulation of protein levels during 
therapeutic intervention. 



WO 2005/005471 



PCT/EP2004/007531 



-26- 

In one aspect, hybridization with probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding the 
proteins of the invention and homologous proteins or closely related 
molecules, may be used to identify nucleic acid sequences which encode the 
respective protein. The hybridization probes of the subject invention may be 
DNA or RNA and are preferably derived from the nucleotide sequence of the 
polynucleotide encoding the proteins of the invention or from a genomic 
sequence including promoter, enhancer elements, and introns of the naturally 
occurring gene. Hybridization probes may be labeled by a variety of reporter 
groups, for example, radionuclides such as 32 P or 35 S or enzymatic labels, such 
as alkaline phosphatase coupled to the probe via avidin/biotin coupling 
systems, and the like. 

Polynucleotide sequences specific for DG153 or DG177 proteins and 
homologous nucleic acids may be used for the diagnosis of conditions or 
diseases, which are associated with the expression of the proteins. Examples 
of such diseases include the pancreatic diseases (e.g. diabetes), obesity, 
metabolic syndrome, and/or others. Polynucleotide sequences specific for the 
DG153 or DG177 proteins and homologous proteins may also be used to 
monitor the progress of patients receiving treatment for pancreatic diseases 
(e.g. diabetes), obesity, and/or metabolic syndrome. The polynucleotide 
sequences may be used qualitative or quantitative assays, e.g. in Southern or 
Northern analysis, dot blot or other membrane-based technologies; in PGR 
technologies; or in dip stick, pin, ELISA or chip assays utilizing fluids or tissues 
from patient biopsies to detect altered gene expression. 

In a particular aspect, the DG153 or DG177 nucleotide sequences may be 
useful in assays that detect activation or induction of various metabolic 
diseases or dysfunctions. The nucleotide sequences may be labeled by 
standard methods, and added to a fluid or tissue sample from a patient under 
conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. The presence of altered levels of 
nucleotide sequences encoding the proteins of the invention and homologous 
proteins in the sample indicates the presence of the associated disease. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic 
treatment regimen in animal studies, in clinical trials or in monitoring the 
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treatment of an individual patient 

in order to provide a basis for the diagnosis of a disease associated with 
expression of the DG153 or DG177 proteins and homologous proteins, a 

5 normal or standard profile for expression is established. This may be 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence or a fragment thereof, which 
is specific for the nucleic acids encoding the proteins of the invention and 
homologous nucleic acids, under conditions suitable for hybridization or 

10 amplification. Standard hybridization may be quantified by comparing the 
values obtained from normal subjects with those from an experiment where a 
known amount of a substantially purified polynucleotide is used. Standard 
values obtained from normal samples may be compared with values obtained 
from samples from patients who are symptomatic for disease. Deviation 

15 between standard and subject values is used to establish the presence of 
disease. Once disease is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to evaluate whether 
the level of expression in the patient begins to approximate that, which is 
observed in the normal patient. The results obtained from successive assays 

20 may be used to show the efficacy of treatment over a period ranging from 
several days to months. 

With respect to pancreatic diseases (e.g. diabetes), obesity, and/or metabolic 
syndrome, the presence of an unusual amount of transcript in biopsied tissue 

25 from an individual may indicate a predisposition for the development of the 
disease or may provide a means for detecting the disease prior to the 
appearance of actual clinical symptoms. A more definitive diagnosis of this 
type may allow health professionals to employ preventative measures or 
aggressive treatment earlier thereby preventing the development or further 

30 progression of the metabolic diseases and disorders. 

Additional diagnostic uses for oligonucleotides designed from the sequences 
encoding the proteins of the invention and homologous proteins may involve 
the use of PCR. Such oligomers may be chemically synthesized, generated 
35 enzymatically or produced from a recombinant source. Oligomers will 
preferably consist of two nucleotide sequences, one with sense orientation 
(5prime.fwdarw.3prime) and another with antisense (3prime.rarw.5prime), 
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employed under optimized conditions for identification of a specific gene or 
condition. The same two oligomers, nested sets of oligomers or even a 
degenerate pool of oligomers may be employed under less stringent conditions 
for detection and/or quantification of closely related DNA or RNA sequences. 

5 

In another embodiment of the invention, the nucleic acid sequences may also 
be used to generate hybridization probes, which are useful for mapping the 
naturally occurring genomic sequence. The sequences may be mapped to a 
particular chromosome or to a specific region of the chromosome using well- 

10 known techniques. Such techniques include FISH, FACS or artificial 
chromosome constructions, such as yeast artificial chromosomes, bacterial 
artificial chromosomes, bacterial P1 constructions or single chromosome cDNA 
libraries as reviewed in Price, C. M. (1993) Blood Rev. 7:127-134, and Trask, 
B. J. (1991) Trends Genet. 7:149-154. FISH (as described in Verma et al. 

is (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon 
Press, New York, N.Y.). The results may be correlated with other physical 
chromosome mapping techniques and genetic map data. Examples of genetic 
map data can be found in the 1994 Genome Issue of Science (265: 1981f). 
Correlation between the location of the gene encoding the proteins of the 

20 invention on a physical chromosomal map and a specific disease or 
predisposition to a specific disease, may help to delimit the region of DNA 
associated with that genetic disease. 

The nucleotide sequences of the subject invention may be used to detect 
25 differences in gene sequences between normal, carrier or affected individuals. 
An analysis of polymorphisms, e.g. single nucleotide polymorphisms may be 
carried out. Further, in situ hybridization of chromosomal preparations and 
physical mapping techniques such as linkage analysis using established 
chromosomal markers may be used for extending genetic maps. Often the 
30 placement of a gene on the chromosome of another mammalian species, such 
as mouse, may reveal associated markers even if the number or arm of a 
particular human chromosome is not known. New sequences can be assigned 
to chromosomal arms or parts thereof, by physical mapping. This provides 
valuable information to investigators searching for disease genes using 
35 positional cloning or other gene discovery techniques. Once the disease or 
syndrome has been crudely localized by genetic linkage to a particular 
genomic region, for example, AT to 11q22-23 (Gatti, R. A. et al. (1988) Nature 
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336: 577-580), any sequences mapping to that area may represent associated 
or regulatory genes for further investigation. The nucleotide sequences of the 
subject invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc. among normal, carrier or affected 
5 individuals. 

In another embodiment of the invention, the proteins of the invention, their 
catalytic or immunogenic fragments or oligopeptides thereof, an in vitro model, 
a genetically altered cell or animal, can be used for screening libraries of 

10 compounds in any of a variety of drug screening techniques. One can identify 
effectors, e.g. receptors, enzymes, proteins, ligands, or substrates that bind to, 
modulate or mimic the action of one or more of the DG153 or DG177 proteins 
of the invention. The protein or fragment thereof employed in such screening 
may be free in solution, affixed to a solid support, borne on a cell surface, or 

15 located intracellular^ The formation of binding complexes, between the 
DG153 or DG177 proteins of the invention and the agent tested, may be 
measured. Agents could also, either directly or indirectly, influence the activity 
of the proteins of the invention. 

20 In addition activity of the proteins of the invention against their physiological 
substrate(s) or derivatives thereof could be measured in cell-based or cell-free 
assays. Agents may also interfere with posttranslational modifications of the 
protein, such as phosphorylation and dephosphorylation, farnesylation, 
palmitoylation, acetylation, alkylation, ubiquitination, proteolytic processing, 

25 subcellular localization and degradation. Moreover, agents could influence the 
dimerization or oligomerization of the proteins of the invention or, in a 
heterologous manner, of the proteins of the invention with other proteins, for 
example, but not exclusively, docking proteins, enzymes, receptors, or 
translation factors. Agents could also act on the physical interaction of the 

30 proteins of this invention with other proteins, which are required for protein 
function, for example, but not exclusively, their downstream signaling. 

Methods for determining protein-protein interaction are well known in the art. 
For example binding of a fluorescently labeled peptide derived from the 
35 interacting protein to the DG153 or DG177 protein of the invention, or vice 
versa, could be detected by a change in polarisation. In case that both binding 
partners, which can be either the full length proteins as well as one binding 
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partner as the full length protein and the other just represented as a peptide 
are fluorescently labeled, binding could be detected by fluorescence energy 
transfer (FRET) from one fluorophore to the other. In addition, a variety of 
commercially available assay principles suitable for detection of protein-protein 
Interaction are well known In the art, for example but not exclusively 
AlphaScreen (PerkinElmer) or Scintillation Proximity Assays (SPA) by 
Amersham. Alternatively, the interaction of the DG153 or DG177 proteins of 
the invention with cellular proteins could be the basis for a cell-based 
screening assay, in which both proteins are fluorescently labeled and 
interaction of both proteins is detected by analysing cotranslocation of both 
proteins with a cellular imaging reader, as has been developed for example, 
but not exclusively, by Cellomics or EvotecOAI. In all cases the two or more 
binding partners can be different proteins with one being the protein of the 
invention, or in case of dimerization and/or oligomerization the protein of the 
invention itself. 

Of particular interest are screening assays for agents that have a low toxicity 
for mammalian cells. The term "agent" as used herein describes any molecule, 
e.g. protein or pharmaceutical, with the capability of altering or mimicking the 
physiological function of one or more of the proteins of the invention. 
Candidate agents encompass numerous chemical classes, though typically 
they are organic molecules, preferably small organic compounds having a 
molecular weight of more than 50 and less than about 2,500 Daltons. 
Candidate agents comprise functional groups necessary for structural 
interaction with proteins, particularly hydrogen bonding, and typically include at 
least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of 
the functional chemical groups. The candidate agents often comprise 
carbocyclic or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. 

Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, nucleic acids and 
derivatives, structural analogs or combinations thereof. Candidate agents are 
obtained from a wide variety of sources including libraries of synthetic or 
natural compounds. For example, numerous means are available for random 
and directed synthesis of a wide variety of organic compounds and 
biomolecules, including expression of randomized oligonucleotides and 
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oligopeptides. Alternatively, libraries of natural compounds In the form of 
bacterial, fungal, plant and animal extracts are available or readily produced. 
Additionally, natural or synthetically produced libraries and compounds are 
readily modified through conventional chemical, physical and biochemical 

s means, and may be used to produce combinatorial libraries. Known 
pharmacological agents may be subjected to directed or random chemical 
modifications, such as acylation, alkylation, esterification, amidification, etc. to 
produce structural analogs. Where the screening assay is a binding assay, 
one or more of the molecules may be joined to a label, where the label can 

10 directly or indirectly provide a detectable signal. 

Another technique for drug screening, which may be used, provides for high 
throughput screening of compounds having suitable binding affinity to the 
protein of interest as described in published PCT application WO 84/03564. In 

15 this method, as applied to the proteins of the invention large numbers of 
different small test compounds, e.g. aptamers, peptides, low-molecular weight 
compounds etc., are provided or synthesized on a solid substrate, such as 
plastic pins or some other surface. The test compounds are reacted with the 
proteins or fragments thereof, and washed. Bound proteins are then detected 

20 by methods well known in the art. Purified proteins can also be coated directly 
onto plates for use in the aforementioned drug screening techniques. 
Alternatively, non-neutralizing antibodies can be used to capture the peptide 
and immobilize it on a solid support. In another embodiment, one may use 
competitive drug screening assays in which neutralizing antibodies capable of 

25 binding the protein specifically compete with a test compound for binding the 
protein. In this manner, the antibodies can be used to detect the presence of 
any peptide, which shares one or more antigenic determinants with the protein. 

Compounds that bind DG153 or DG177 proteins, e.g. antibodies, are useful for 
30 the identification or enrichment of cells, which are positive for the expression of 
the proteins of the invention, from complex cell mixtures. Such cell populations 
are useful in transplantation, for experimental evaluation, and as source of 
lineage and cell specific products, including mRNA species useful in identifying 
genes specifically expressed in these cells, and as target for the identification 
35 of factors of molecules that can affect them. Cells expressing the protein of the 
invention or which have been treated with the protein of the invention are 
useful in transplantation to provide a recipient with pancreatic islet cells, 
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including insulin producing beta cells; for drug screening; experimental models 
of islet differentiation and interaction with other cell types; in vitro screening 
assays to define growth and differentiation factors, and to additionally 
characterize genes involved in islet development and regulation; and the like. 
The native cells may be used for these purposes, or they may be genetically 
modified to provide altered capabilities. Cells from a regenerating pancreas, 
from embryonic foregut, stomach and duodenum, or other sources of 
pancreatic progenitor cells may be used as a starting population. The 
progenitor cells may be obtained from any mammalian species, e.g. equine, 
bovine, porcine, canine, feline, rodent, e.g. mice, rats, hamster, primate, etc. 
particularly human. 

In another embodiment, in a high-throughput screening method, the cells are 
transfected with a DNA construct, e.g. a viral or non-viral vector containing a 
reporter gene, e.g. the lacZ gene or the GFP gene, under regulatory control of 
a promoter of a gene involved in for example beta-cell differentiation, e.g. a 
promoter of a gene stimulation beta-cell differentiation, preferably a Pax4 
promoter. The transfected cells are divided into aliquots and each aliquot is 
contacted with a test substance, e.g., candidate 1 , candidate 2 and candidate 
3. The activity of the reporter gene corresponds to the capability of the test 
compound to induce beta-cell differentiation. 

In a further embodiment, which may be combined with the high-throughput 
screening as described above, a medium throughput validation is carried out. 
Therein, the test compound is added to stem cells being cultivated and the 
insulin production is determined. Following an initial high throughput assay, 
such as the cell based assay outlined above where for example a Pax4 
promoter is used as marker for beta-cell regeneration, the activity of candidate 
molecules to induce beta-cell differentiation is tested in a validation assay 
comprising adding said compounds to the culture media of the embryoid 
bodies. Differentiation into insulin-producing cells is then evaluated, e.g. by 
comparison to wild type and/or Pax4 expressing ES cells to assess the 
effectiveness of a compound. 

The nucleic acids encoding the DG153 or DG177 proteins of the invention can 
be used to generate transgenic cell lines and animals. These transgenic 
non-human animals are useful in the study of the function and regulation of the 
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proteins of the invention in vivo. Transgenic animals, particularly mammalian 
transgenic animals, can serve as a model system for the investigation of many 
developmental and cellular processes common to humans. A variety of 
non-human models of metabolic disorders can be used to test modulators of 
5 the protein of the invention. Misexpression (for example, over-expression or 
lack of expression) of the protein of the invention, particular feeding conditions, 
and/or administration of biologically active compounds can create models of 
metabolic disorders. 

10 In one embodiment of the invention, such assays use mouse models of insulin 
resistance and/or diabetes, such as mice carrying gene knockouts in the leptin 
pathway (for example, ob (leptin) or db (leptin receptor) mice), as described 
above. In addition to testing the expression of the proteins of the invention in 
such mouse strains (see Examples), these mice could be used to test whether 

is administration of a candidate modulator alters for example lipid accumulation 
in the liver, in plasma, or adipose tissues using standard assays well known in 
the art, such as FPLC, colorimetric assays, blood glucose level tests, insulin 
tolerance tests and others. 

20 Transgenic animals may be made through homologous recombination in non- 
human embryonic stem cells, where the normal locus of the gene encoding the 
protein of the invention is altered. Alternatively, a nucleic acid construct 
encoding a protein of the invention is injected into oocytes and is randomly 
integrated into the genome. Vectors for stable integration include plasmids, 

25 retroviruses and other animal viruses, yeast artificial chromosomes (YACs), 
and the like. The modified cells or animals are useful in the study of the 
function and regulation of the protein of the invention. For example, a series 
of small deletions and/or substitutions may be made in the gene that encodes 
the protein of the invention to determine the role of particular domains of the 

30 protein, functions in pancreatic differentiation, etc. 

Furthermore, variants of the genes of the invention like specific constructs of 
interest include anti-sense molecules, which will block the expression of the 
protein of the invention, or expression of dominant negative mutations. A 
35 detectable marker, such as lac Z or luciferase may be introduced into the locus 
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of the genes of the invention, where up-regulation of expression of the genes 
of the invention will result in an easily detected change in phenotype. 

One may also provide for expression of the genes of the invention or variants 
thereof in cells or tissues where it is not normally expressed or at abnormal 
times of development. In addition, by providing expression of the protein of the 
invention in cells in which they are not normally produced, one can induce 
changes in cell behavior. 

DNA constructs for homologous recombination will comprise at least portions 
of the genes of the invention with the desired genetic modification, and will 
include regions of homology to the target locus. DNA constructs for random 
integration do not need to contain regions of homology to mediate 
recombination. Conveniently, markers for positive and negative selection are 
included. DNA constructs for random integration will consist of the nucleic 
acids encoding the proteins of the invention, a regulatory element (promoter), 
an intron and a poly-adenylation signal. Methods for generating cells having 
targeted gene modifications through homologous recombination are known in 
the art. For non-human embryonic stem (ES) cells, an ES cell line may be 
employed, or embryonic cells may be obtained freshly from a host, e.g. mouse, 
rat, guinea pig, etc. Such cells are grown on an appropriate fibroblast-feeder 
layer and are grown in the presence of leukemia inhibiting factor (LIF). 

When non-human ES or embryonic cells or somatic pluripotent stem cells 
have been transfected, they may be used to produce transgenic animals. 
After transfection, the ceils are plated onto a feeder layer in an appropriate 
medium. Cells containing the construct may be selected by employing a 
selective medium. After sufficient time for colonies to grow, they are picked and 
analyzed for the occurrence of homologous recombination or integration of the 
construct. Those colonies that are positive may then be used for embryo 
transfection and morula aggregation. Briefly, morulae are obtained from 4 to 6 
weeks old superovulated females, the Zona Pellucida is removed and the 
morulae are put into small depressions of a tissue culture dish. The ES cells 
are trypsinized, and the modified cells are placed into the depression closely to 
the morulae. On the following day the aggregates are transfered into the 
uterine horns of pseudopregnant females. Females are then allowed to go to 



WO 2005/005471 



PCT7EP2004/007531 



-35- 

term. Chimeric offsprings can be readily detected by a change in coat color 
and are subsequently screened for the transmission of the mutation into the 
next generation (F1 -generation). Offspring of the F1 -generation are screened 
for the presence of the modified gene and males and females having the 
modification are mated to produce homozygous progeny, if the gene 
alterations cause lethality at some point in development, tissues or organs can 
be maintained as allogenic or congenic grafts or transplants, or in vitro culture. 
The transgenic animals may be any non-human mammal, such as laboratory 
animal, domestic animals, etc., for example, mouse, rat, guinea pig, sheep, 
cow, pig, and others. The transgenic animals may be used in functional 
studies, drug screening, and other applications and are useful in the study of 
the function and regulation of the proteins of the invention in vivo. 

Finally, the invention also relates to a kit comprising at least one of 

(a) a nucleic acid molecule coding for a protein of the invention or a 
functional fragment thereof; 

(b) a protein of the invention or a fragment or an isoform thereof; 

(c) a vector comprising the nucleic acid of (a); 

(d) a host cell comprising the nucleic acid of (a) or the vector of (c); 

(e) a polypeptide encoded by the nucleic acid of (a); 

(f) a fusion polypeptide encoded by the nucleic acid of (a); 

(g) an antibody, an aptamer or another effector/modulator against the 
nucleic acid of (a) or the polypeptide of (b), (e) or (f) and 

(h) an anti-sense oligonucleotide of the nucleic acid of (a). 

The kit may be used for diagnostic or therapeutic purposes or for screening 
applications as described above. The kit may further contain user instructions. 

The Figures show: 

Fig. 1 shows human DG153 nucleic acid and proteins. 

Fig. 1A shows the nucleic acid sequence of human DG153 protein (SEQ ID 
NO: 1; GenBank Accession Number M83751). 

Fig. 1B shows the amino acid sequence (one-letter code) of human DG153 
protein, longer variant (SEQ ID NO: 2; GenBank Accession Number 
AAB08753). 

Fig. 1C shows the amino acid sequence (one-letter code) of human DG153 
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protein, shorter variant (SEQ ID NO: 3; Swiss-Prot Accession Number 
P55145). 

Fig. 2 shows the analysis of DG153 protein expression in mammalian 
tissues and cells. The relative RNA-expression is shown on the Y-axis, in 
Fig. 2A to 2C, the tissues tested are given on the X-axis. WAT refers to 
white adipose tissue, BAT refers to brown adipose tissue. In Fig. 2D to 2F, 
the X-axis represents the time axis. 'dO' refers to day 0 (start of the 
experiment), "d10" refers to day 10 of adipocyte differentiation, "d12" refers 
to day 12 of adipocyte differentiation). 

Fig. 2A shows the quantitative (real-time PCR) analysis of DG153 expression 
in mouse wild-type tissues. 

Fig. 2B shows the quantitative (real-time PCR) analysis of DG153 expression 
in genetically obese mice (ob/ob-mice) and fasted mice (fasted-mice) 
compared to wild-type mice (wt-mice). 

Fig. 2C shows the quantitative (real-time PCR) analysis of DG153 expression 
in mice fed with a high fat diet compared to mice fed with a control diet. 
Fig. 2D shows the quantitative (real-time PCR) analysis of DG153 expression 
in mouse fibroblast (3T3-L1) cells during the differentiation from preadipocytes 
to mature adipocytes. 

Fig. 2E shows the microarray analysis of DG153 expression in mouse 
fibroblast (3T3-L1 ) cells during the differentiation from preadipocytes to mature 
adipocytes. 

Fig. 2F shows the microarray analysis of DG153 expression in human 
abdominal adipocyte cells, during the differentiation from preadipocytes to 
mature adipocytes. 

Fig. 3 shows human DG177 nucleic acid and protein. 

Fig. 3A shows the nucleic acid sequence of human DG177 protein (SEQ ID 
NO: 4; GenBank Accession Number NM_007085). 

Fig. 3B shows the amino acid sequence (one-letter code) of human DG177 
protein (SEQ ID NO: 5; GenBank Accession Number NP_009016). 

Fig. 4 shows the quantitative (real-time PCR) analysis of DG177 protein 
expression in mammalian tissues and cells. The relative RNA-expression is 
shown on the Y-axis, in Fig. 4A to 4D the tissues tested are given on the 
X-axis. WAT refers to white adipose tissue, BAT refers to brown adipose 
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tissue. In Fig. 4E, the X-axis represents the time axis. 'dO' refers to day 0 
(start of the experiment), 'd2' - 'd10' refers to day 2 - day 10 of adipocyte 
differentiation). 

Fig. 4A shows the analysis of DG177 expression in wild-type mouse tissues. 
Fig. 4B shows the analysis of DG177 expression in genetically obese mice 
(ob/ob-mice) and fasted mice (fasted-mice) compared to wild-type mice 
(wt-mice). 

Fig. 4C shows the analysis of DG177 expression in mice fed with a high fat 
diet compared to mice fed with a control diet. 

Fig. 4D shows the analysis of DG177 expression in mouse tissues from 

non-obese-diabetic (NOD) mice compared to wild-type mice. 

Fig. 4E shows the analysis of DG177 expression in mammalian fibroblast 

(3T3-L1) cells during the differentiation from preadipocytes to mature 

adipocytes. 

The examples illustrate the invention: 

Example 1: Identification of secreted factors expressed in pancreas 

A screen for secreted factors expressed in developing mouse pancreas was 
carried out according to methods known by those skilled in the art (see, for 
example Pera E.M. and De Robertis E.M., (2000) Mech Dev 96: 183-195) with 
several modifications. 

Expression cDNA library: 

During organogenesis, the pancreatic bud is surrounded and influenced by the 
associated mesenchyme, (see for example, Madsen O.D. et al., (1996) Eur. J. 
Biochem. 242: 435-445 and Slack, J.M., (1995) Development 121: 1569-1580). 
Recently, it was suggested, that white adipocytes origin directly from 
mesenchymal cells (Atanossova P.K., (2003) Folia Med. 45: 41-45). During 
embryogenesis, the innervation and vascularization of the pancreas can be 
observed. Therefore, the tissue used in the screen might have contained 
besides pancreatic cells some adipocyte precursors, blood vessels, as well as 
neuronal cells. 
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A mouse embryonic stage 9.5-15 pancreatic bud library was prepared in 
pCMVSPORT-6 vector using SUPERSCRIPT Plasmid System from Invitrogen 
according to the manufacturer's instructions. The non-amplified library was 
electroporated into MaxEff DH10B cells (Invitrogen). 

Secretion cloning 

Bacterial clones were picked with sterile toothpicks from agar plates and 
cultured in 96-deep-well microtiter plates in LB-ampicillin (see Sambrook et al., 
supra). Aliquots of 8 cultures were pooled, and plasmid DNAwas isolated using 
the BioRobot_9600 apparatus according to the manufacturer's instructions 
(Qiagen; QIAprep(r) Turbo BioRobot Kit. Human 293 cell culture cells were 
cultured in 75 ml tissue culture flasks in DMEM and 10% fetal calf serum. At 
90-99% confluence, the cells were splitted at 1:3 ratio and plated onto 
poly-D-lysine (Sigma) coated 96-well plates. Cells were transfected with 
100-500 ng plasmid using lipofectamine 2000 (Invitrogen). After 6 hours, the 
medium was exchanged for fresh complete growth medium. 24 hours after 
transfectjon, the cells were washed twice with DMEM without cysteine and 
methionine (Invitrogen), supplemented with 1% dialysed Bovine serum (Sigma) 
with 50 microgram per ml Heparin (Sigma) and glutamine. The cells were 
labeled radioactively ('S35 Met-label', from Hartmann Analytic GmbH). After 12 
hours, aliquots of the supernatants were harvested in 96-well PCR plates and 
subjected to SDS gel electrophoresis in precast 420% gradient polyacrylamide 
Criterion gels (Biorad) under reducing conditions, using Criterion Dodeca Cell 
gel running chamber (Biorad). The gels were fixed in 10% acetic acid, 25% 
isopropanol for 30 min, soaked 15-30 min in AMPLIFY reagent (Amersham), 
dried and exposed to X-OMAT (AR) film (Kodak). Positive clones were 
identified and regrown in 96-well-plates. DNA of individual clones was 
prepared and used for transfection as described above. If one of the clones 
yielded proteins of the same size as that of the original pool, a positive clone 
was identified. Positive clones were partially sequenced from the 5' end 
(SEQLAB, Goettingen). 

Example 2: Identification of the human DG153 or DG177 homologous 
nucleic acid and proteins 

The term "polynucleotide comprising the nucleotide sequence as shown in 
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GenBank Accession number" relates to the expressible gene of the nucleotide 
sequences deposited under the corresponding GenBank Accession number. 
The term "GenBank Accession number" relates to NCBI GenBank database 
entries (Ref.: Benson et ah, Nucleic Acids Res. 28 (2000) 15-18). 

DG153 or DG177 homologous proteins and nucleic acid molecules coding 
therefore are obtainable from insect or vertebrate species, e.g. mammals or 
birds. Particularly preferred are nucleic acids comprising human DG153 or 
DG177 homologs. The following mouse sequences were identfied in the 
secreted factor screen': Mus musculus arginine-rich, mutated in early stage 
tumors (Armet), GenBank Accession Number NM_029103 (838 base pairs 
mRNA) and GenBank Accession Number NP_083379" (179 amino acid 
protein)), and Mus musculus follistatin-like 1 (FstM), GenBank Accession 
Number NM_008047 and GenBank Accession Number NP_032073. 

Sequences homologous to mouse DG153 or DG177 were identified using the 
publicly available program BLASTP 2.2.3 of the non-redundant protein data 
base of the National Center for Biotechnology Information (NCBI) (see, 
Altschul et al., 1997, Nucleic Acids Res. 25: 3389-3402). The best human 
homologs of mouse DG153 are Homo sapiens arginine-rich protein (ARP) 
gene GenBank Accession Number M83751 (1103 base pairs mRNA; SEQ ID 
NO: 1 ; see Fig. 1 A), GenBank Accession Number AAB08753.1 (234 amino 
acid protein; SEQ ID NO: 2; see Fig. 1B), and Swiss-Prot Accession Number 
P55145 (179 amino acid protein; SEQ ID NO: 3; see Fig. 1C). The 234 amino 
acid protein is a longer sequence with arginine-rich regions thought to be the 
target of cancer-causing mutation. The best human homolog of mouse DG177 
is GenBank Accession Number NM_007085 (SEQ ID NO: 4; see Fig. 3A) and 
GenBank Accession Number NP_009016 (SEQ ID NO: 5; see Fig. 3B). 

Example 3: Expression of the polypeptides in mammalian tissues 

To analyse the expression of the polypeptides disclosed in this invention in 
mammalian tissues, several mouse strains (preferably mice strains C57BI/6J, 
C57BI/6 ob/ob and C57BI/KS db/db, and Non-Obese-Diabetic (NOD) mice 
which are standard model systems in obesity and diabetes research) were 
purchased from Harlan Winkelmann (33178 Borchen, Germany) and Taconic 
M & B (Germantown, NY 12526, U.S. A), respectively, and maintained under 
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constant temperature (preferably 22°C), 40 per cent humidity and a light/dark 
cycle of preferably 14/10 hours. The mice were fed a standard chow (for 
example, from ssniff Spezialitaten GmbH, order number ssniff M-Z 
V1 126-000). For the fasting experiment ("fasted wild-type mice"), wild-type 
mice were starved for 48 h without food, but only water supplied ad libitum 
(see, for example, Schnetzler et al., (1993) J Clin Invest 92: 272-280, Mizuno 
et al., (1996) Proc Natl Acad Sci 93: 3434-3438). In a further experiment wild- 
type (wt) mice were fed a control diet (preferably Altromin C1057 mod 
control, 4.5% crude fat) or high fat diet (preferably Altromin C1057mod. high 
fat, 23.5% crude fat). Animals were sacrificed at an age of 6 to 8 weeks. The 
animal tissues were isolated according to standard procedures known to those 
skilled in the art, snap frozen in liquid nitrogen and stored at -80°C until 
needed. 

For analyzing the role of the proteins disclosed in this invention in the in vitro 
differentiation of different mammalian cell culture cells for the conversion of 
pre-adipocytes to adipocytes, mammalian fibroblast (3T3-L1) cells (e.g., Green 
& Kehinde, Cell 1: 113-116, 1974) were obtained from the American Tissue 
Culture Collection (ATCC, Hanassas, VA, USA; ATCC- CL 173). 3T3-L1 cells 
were maintained as fibroblasts and differentiated into adipocytes as described 
in the prior art (e.g., Qiu. et al., J. Biol. Chem. 276: 11988-11995, 2001; Slieker 
et al., BBRC 251: 225-229, 1998). In brief, cells were plated in DMEM/10% 
FCS (Invitrogen, Karlsruhe, Germany) at 50,000 cells/well in duplicates in 
6-well plastic dishes and cultured in a humidified atmosphere of 5% CO z at 37° 
C. At confluence (defined as day 0: dO) cells were transferred to serum-free 
(SF) medium, containing DMEM/HamF12 (3:1; Invitrogen), fetuin (300 ug/ml; 
Sigma, Munich, Germany), transferrin (2 ug/ml; Sigma), pantothenate (17 uM; 
Sigma), biotin (1 uM; Sigma), and EGF (0.8nM; Hoffmann-La Roche, Basel, 
Switzerland). Differentiation was induced by adding dexamethasone (DEX; 1 
uM; Sigma), 3-methyl-isobutyl-1-methylxanthine (MIX; 0.5mM; Sigma), and 
bovine insulin (5 ug/ml; Invitrogen). Four days after confluence (d4), cells were 
kept in SF medium, containing bovine insulin (5 ug/ml) until differentiation was 
completed. At various time points of the differentiation procedure, beginning 
with day 0 (day of confluence) and day 2 (hormone addition; for example, 
dexamethasone and 3-isobutyl-1-methylxanthine), up to 10 days of 
differentiation, suitable aliquots of cells were taken every two days. 
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RNA was isolated from mouse tissues or cell culture cells using Trizol Reagent 
(for example, from Invitrogen, Karlsruhe, Germany) and further purified with the 
RNeasy Kit (for example, from Qiagen, Germany) in combination with an 
DNase-treatment according to the instructions of the manufacturers and as 

5 known to those skilled in the art. Total RNA was reverse transcribed 
(preferably using Superscript II RNaseH- Reverse Transcriptase, from 
Invitrogen, Karlsruhe, Germany) and subjected to Taqman analysis preferably 
using the Taqman 2xPCR Master Mix (from Applied Biosystems, Weiterstadt, 
Germany; the Mix contains according to the Manufacturer for example 

10 AmpliTaq Gold DNA Polymerase, AmpErase UNG, dNTPs with dUTP, passive 
reference Rox and optimized buffer components) on a GeneAmp 5700 
Sequence Detection System (from Applied Biosystems, Weiterstadt, 
Germany). 

15 The following prime/probe pairs were used for the TaqMan analysis (GenBank 
Accession Number NM_029103 for the mouse DG153 sequence): Mouse 
DG153 forward primer (Seq ID NO:6): 5'- AGA GAA TCG GTT GTG CTA CTA 
CAT TG -3'; mouse DG153 reverse primer (Seq ID NO:7): 5'- GGC TTC GAC 
ACC TCA TTG ATG -3'; mouse DG153 Taqman probe (Seq ID NO:8): 

20 (5/6-FAM) AGC CAC AGA TGA TGC TGC CAC CAA (5/6-TAMRA). 

The following prime/probe pairs were used for the TaqMan analysis (GenBank 
Accession Number NM_008047 for the mouse DG177 sequence): Mouse 
DG177 forward primer (Seq ID NO:9): 5'- GAA GTC TGC GAG TCC ATC 
25 TGC -3'; mouse DG177 reverse primer (Seq ID NO:10): 5'- GCG CCG TCG 
GAG CTC -3'; mouse DG177 Taqman probe (Seq ID NO:1 1): (5/6-FAM)- AGC 
CCA GTT GTC TGC TAT CAA GCT AAC CG- (5/6-TAMRA). 

The function of the mammalian DG153 and DG177 in metabolism was further 
30 validated by analyzing the expression of the transcripts in different tissues and 
by analyzing the role in adipocyte differentiation. 

Expression profiling studies confirm the particular relevance of DG153 and 
DG177 as regulators of energy metabolism in mammals. Quantitative PCR 
35 (Taqman) analysis revealed that DG153 is expressed in several mammalian 
tissues, with highest expression levels in testis in wild type mice. In addition, 
DG153 is highly expressed in metabolic active tissue such as white adipose 
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tissue (WAT) and at lower levels in brown adipose tissue (BAT) compared to 
other tissue types in wild type mouse as depicted in Fig. 2A DG177 is 
expressed in several mammalian tissues, with highest expression levels in 
white adipose tissue (WAT) in wild type mice. In addition, DG177 is also highly 
expressed in muscle, heart, lung, and at lower, but robust levels in brown 
adipose tissue (BAT), hypothalamus and brain in wild type mice, as depicted in 
Fig. 4A 

We used mouse models of insulin resistance and/or diabetes, such, as mice 
carrying gene knockouts in the leptin pathway (for example, ob/ob (leptin) or 
db (leptin receptor/ligand) mice) to study the expression of DG153 and DG177. 
Such mice develop typical symptoms of diabetes, show hepatic lipid 
accumulation and frequently have increased plasma lipid levels (see Bruning 
et al, 1998, Mol. Ceil. 2: 559-569). We found, for example, that the expression 
of DG153 is down-regulated in metabolic active tissue (WAT) in genetically 
induced obese mice (ob/ob) compared to wild type mice (see Fig. 2B). We 
found, that the expression of DG177 is significantly up-regulated in metabolic 
active tissue (BAT) in genetically induced obese mice (ob/ob) compared to wild 
type mice (see Fig. 4B). We also found, that the expression of DG177 is down- 
regulated in WAT and liver of fasted wild type mice, compared to wild type 
mice fed a standard diet (Fig. 4B). 

Expression of DG153 and DG177 mRNA was also examined in susceptible 
wild type mice (for example, C57BI/6) that show symptoms of diabetes, lipid 
accumulation, and high plasma lipid levels, if fed a high fat diet. In those mice, 
the expression of DG153 is significantly down-regulated in WAT and up- 
regulated in BAT and muscle (see Fig. 2C) supporting that DG153 is involved 
in the regulation of mammalian metabolism. The expression of DG177 is 
significantly up-regulated in WAT and muscle (see Fig. 4C) supporting that 
DG177 is involved in the regulation of mammalian metabolism. 

In non-obese diabetic (NOD) mice, expression of DG177 is significantly down- 
regulated in white adipose tissue compared to wild type mice, as shown in Fig. 
4D. 
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We show in this invention that DG153 is expressed and decreased during the 
differentiation of preadipocytes into mature adipocyctes (Fig. 2D). We further 
show (see Fig. 4E) that the DG177 mRNA is expressed and down-regulated 
during the differentiation of preadipocytes into mature adipocyctes. Therefore, 
the DG153 and DG177 protein might also play an essential role in 
adipogenesis. 

Example 4. Analysis of the differential expression of transcripts of the 
proteins of the invention in mammalian tissues 

RNA preparation from murine 3T3-L1 cells and human primary adipose tissues 
was done as described in Example 3. The target preparation, hybridization, 
and scanning was performed as described in the manufactures manual (see 
Affymetrix Technical Manual, 2002, obtained from Affmetrix, Santa Clara, 
USA). 

The expression analysis (using Affymetrix GeneChips) of the DG153 gene 
using 3T3-L1 and primary human abdominal adipocycte differentiation clearly 
shows differential expression of murine and human DG153 in adipocytes. Two 
independent experiments were done, respectively. The experiments show that 
the DG153 transcripts are most abundant at day 0 compared to day 10 / day 
12 during differentiation (Fig. 2E and Fig. 2F). These data further confirm the 
mouse 3T3-L1 differentiation data shown in Fig. 2D. 

Thus, the DG153 protein has to be slightly decreased in order for the 
preadipocyctes to differentiate into mature adipocycte. The DG153 protein in 
preadipocyctes has the potential to inhibit adipose differentiation. Therefore, 
the DG153 protein might play an essential role in the regulation of human 
metabolism, in particular in the regulation of adipogenesis and thus it might be 
an essential role in pancreatic diseases (e.g. diabetes), obesity, and/or 
metabolic syndrome. 

For the purpose of the present invention, it will understood by the person 
having average skill in the art that any combination of any feature mentioned 
throughout the specification is explicitly disclosed herewith. 



